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4 <110> APPLICANT: University of Wales College of Medicine 
6 <120> TITLE OF INVENTION: Protein and DNA coding therefor 
8 <130> FILE REFERENCE: PCT/GB99/03 654 
C--> 10 <140> CURRENT APPLICATION NUMBER: US/09/831, 14 2A 
C--> 11 <141> CURRENT FILING DATE: 2001-05-07 
13 <160> NUMBER OF SEQ ID NOS : 22 
15 <170> SOFTWARE: Patentln Ver. 2.1 

17 <210> SEQ ID NO: 1 

18 <211> LENGTH: 870 

19 <212> TYPE: DNA 

20 <213> ORGANISM: Pholas dactylus 

22 <400> SEQUENCE: 1 

23 gaattcggca cgagtcggaa aagaacaaaa tggcttgtat cgttttcgtt gctcttgtcg 60 
25 ctctatgctt aatgcaaccg ggttccggtg aggaagtaca atgcgcgatg aattggacac 120 
27 aagctaatga atatgtgttc aacgtggact ggatgaccat tttcatctac gactatggcg 180 
29 ctcaagagca actgtacgaa gatcgggctt tggggctgtg tcggattgaa cgggccggcc 240 
31 caggtaccac aaaagccgtc tggattaact ggagtaacga cacgcagtca tgtgtaacaa 300 
33 gaaaaacaat cttcttcgag gttggtggag aaattgcccg gctagttgac tacagaccac 360 
35 aggaagacgg aactgagaaa acttttacaa gaaaattctc tagcaaaatg ccaggcactt 420 
37 acatgcttat ggacgtgtgc gctacaaggg acgctgatga taaatgcatc gaaggcacaa 480 
39 ttgtggtgac agtcagggtg tccctatatg acgaagataa caatggtgta atggatgaag 540 
41 gtaaggtgat tccatctgag acaatcgagg atgatatcaa ggactgtggg ctcttagacc 600 
43 aagatgttga actcgattat acgtggactc aaaacgagtg tgatctacca gacacagtag 660 
45 acgaggctga agacacaccg tcagaaactg gagaattctt ctggtagatc tatcagacta 720 
47 cttttatcag caggacaact ggtcgttacc agacacctat aacgtgtcct catcaataat 780 
50 gtgtaaaaca gaaataatcg atagaatatt gaaaataaaa tgttaataaa cactggttga 840 
52 aatatgaaaa aaaaaaaaaa aaaactcgag 870 

56 <210> SEQ ID NO: 2 

57 <211> LENGTH: 816 

58 <212> TYPE: DNA 

59 <213> ORGANISM: Pholas dactylus 

61 <4 00> SEQUENCE: 2 

62 gaattcggca cgagggaaaa gaacaaaatg gcttgtatcg ttttcgttgc tcttgtcgct 60 
64 ctatgcttaa tgcaaccggg ttccggtgag gaagtacaat gcgcgatgaa ttggacacaa 120 
66 gctaatgaat atgtgttcaa cgtggactgg atgaccattt tcatctacga ctatggcgct 180 
68 caagagcaac tgtacgagga tcgggctttg gggctgtgtc ggattgaacg ggccggccca 240 
70 ggtaccacaa aagccgtctg gattaactgg agtaacgaca cgcagtcatg tgtaacaaga 300 
72 aaaacaatct tcttcgaggt tggtggagaa attgcccggc tagttgacta cagaccacag 360 
74 gaagacggaa ctgagaaaac ttttacaaga aaattctcta gcaaaatgcc aggcacttac 420 
76 atgcttatgg acgtgtgcgc tacaagggac gctgatgata aatgcatcga aggcacaatt 480 
78 gtggtgacag tcagggtgtc cctatatgac gaagataaca atggtgtaat ggatgaaggt 540 
80 aaggttattc catctgagac aatcgaggat gatatcaagg actgtgggct cttagaccaa 600 
82 gatgttgaac tcgattatac gtggactcaa aacgagtgtg atctaccaga cacagtagac 660 



ENTERED 
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84 gaggctgaag acacaccgtc agaaactgga gaattcttct ggtagatcta tcagaccact 720 

86 tttatcagca ggacaactgg tcgttaccag acacctataa cgtgtcctca tcaataatgt 780 

88 gtaaaacaga aataatcgat agaatattga aaataa 816 

92 <210> SEQ ID NO: 3 

93 <211> LENGTH: 852 

94 <212> TYPE: DNA 

95 <213> ORGANISM: Pholas dactylus 

97 <400> SEQUENCE: 3 

98 gtcggaaaag aacaaaatgg cttgtatcgt tttcgttgct cttgtcgctc tatgcttaat 60 
101 gcaaccgggt tccggtgagg aagtacaatg cgcgatgaat tggacacaag ctaatgaata 120 
103 tgtgttcaac gtggactgga tgaccatttt catctacgac tatggcgctc aagagcaact 180 
105 gtacgaggat cgggctttgg ggctgtgtcg gattgaacgg gccggcccag gtaccacaaa 240 
107 agccgtctgg attaactgga gtaacgacac gcagtcatgt gtaacaagaa aaacaatctt 300 
109 cttcgaggtt ggtggagaaa ttgcccggct agttgactac agaccacagg aagacggaac 360 
111 tgagaaaact tttacaagaa aattctctag caaaatgcca ggcacttaca tgcttatgga 420 
113 cgtgtgcgct acaagggacg ctgatgataa atgcatcgaa ggcacaattg tggtgacagt 480 
115 cagggtgtcc ctatatgacg aagataacaa tggtgtaatg gatgaaggta aggttattcc 540 
117 atctgagaca atcgaggatg atatcaagga ctgtgggctc ttagaccaag atgttgaact 600 
119 cgattatacg tggactcaaa acgagtgtga . tctaccagac acagtagacg aggctgaaga 660 
121 cacaccgtca gaaactggag aattcttctg gtagatctat cagaccactt ttatcagcag 720 
123 gacaactggt cgttaccaga cacctataac gtgtcctcat caataatgtg taaaacagaa 780 
125 ataatcgata gaatattgaa aataaaatgt taatagacac tggttgaaaa aaaaaaaaaa 840 
127 aaaaaactcg ag 852 

131 <210> SEQ ID NO: 4 

132 <211> LENGTH: 225 

133 <212> TYPE: PRT 

134 <213> ORGANISM: Pholas dactylus 
136 <400> SEQUENCE: 4 



137 


Met 


Ala 


Cys 


He 


Val 


Phe 


Val 


Ala 


Leu 


Val 


Ala 


Leu 


Cys 


Leu 


Met 


Gin 


138 


1 








5 










10 










15 




140 


Pro 


Gly 


Ser 


Gly 


Glu 


Glu 


Val 


Gin 


Cys 


Ala 


Met 


Asn 


Trp 


Thr 


Gin 


Ala 


141 








20 










25 










30 






143 


Asn 


Glu 


Tyr 


Val" 


Phe 


Asn 


Val 


Asp 


Trp 


Met 


Thr 


He 


Phe 


He 


Tyr 


Asp 


144 






35 










40 










45 








146 


Tyr 


Gly 


Ala 


Gin 


Glu 


Gin 


Leu 


Tyr 


Glu 


Asp 


Arg 


Ala 


Leu 


Gly 


Leu 


Cys 


147 




50 










55 










60 










149 


Arg 


He 


Glu 


Arg 


Ala 


Gly 


Pro 


Gly 


Thr 


Thr 


Lys 


Ala 


Val 


Trp 


He 


Asn 


151 


65 










70 










75 










80 


153 


Trp 


Ser 


Asn 


Asp 


Thr 


Gin 


Ser 


Cys 


Val 


Thr 


Arg 


Lys 


Thr 


He 


Phe 


Phe 


154 










85 










90 










95 




156 


Glu 


Val 


Gly 


Gly 


Glu 


He 


Ala 


Arg 


Leu 


Val 


Asp 


Tyr 


Arg 


Pro 


Gin 


Glu 


157 








100 










105 










110 






159 


Asp 


Gly 


Thr 


Glu 


Lys 


Thr 


Phe 


Thr 


Arg 


Lys 


Phe 


Ser 


Ser 


Lys 


Met 


Pro 


160 






115 










120 










125 








162 


Gly 


Thr 


Tyr 


Met 


Leu 


Met 


Asp 


Val 


Cys 


Ala 


Thr 


Arg 


Asp 


Ala 


Asp 


Asp 


163 




130 










135 










140 










165 


Lys 


Cys 


He 


Glu 


Gly 


Thr 


He 


Val 


Val 


Thr 


Val 


Arg 


Val 


Ser 


Leu 


Tyr 


166 


145 










150 










155 










160 


168 


Asp 


Glu 


Asp 


Asn 


Asn 


Gly 


Val 


Met 


Asp 


Glu 


Gly 


Lys 


Val 


He 


Pro 


Ser 



file://C:\Crf3\Outhold\VsrI83 1 1 42A.htm 



3/15/02 



RAW SEQUENCE LISTING DATE: 03/15/2002 

PATENT APPLICATION: US/09/831, 142A TIME: 11:08:09 



Input Set : A:\es.txt 

Output Set: N:\CRF3\03152002\I831142A.raw 



1 fiQ 






165 










1 70 

1 / U 










17 5 

1 / j 




171 


Glu Thr lie 


Glu 


Asp 


Asp 


He 


Lys 




v^jr O 


Gl v 


ueix 


T.Pn 
jje li 


A an 
nop 


Rl n 


A an 
nop 


172 




180 










185 

X \J *J 










1 QO 






174 


Val Glu Leu 


Asp 


Tyr 


Thr 


Trp 


Thr 


G1 n 


noil 






Son 
nop 


uc u. 


Pro 
riu 


& an 
nop 


1 75 


195 










200 










905 








1 77 

X / f 


Thr Val Asp 


Glu 


Ala 


Glu 


Asp 


Thr 


prn 


Oei 


ulU 


Thr 

j. in 


(11 v 
ul jr 




php 
rile 


php 

irne 


1 7ft 


210 








215 










990 
£i \j 










180 


Trp 




























1 R1 


225 




























184 


<210> SEQ ID NO 


: 5 
























X O *J 
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318 <213> ORGANISM: Pholas dactylus 

320 <220> FEATURE: 

321 <221> NAME/KEY: modif ied_base 

322 <222> LOCATION: (3) 

323 <223> OTHER INFORMATION: i 
325 <400> SEQUENCE: 9 

W--> 326 tcngtnccyt cytcytg 17 

330 <210> SEQ ID NO: 10 

331 <211> LENGTH: 18 

332 <212> TYPE: DNA 

333 <213> ORGANISM: Pholas dactylus 

335 <220> FEATURE: 

336 <221> NAME/KEY: modif ied_base 

337 <222> LOCATION: (9) 

338 <223> OTHER INFORMATION: i 
340 <400> SEQUENCE: 10 

W--> 341 ttyaaygtng aytggatg 18 

345 <210> SEQ ID NO: 11 

346 <211> LENGTH: 20 

347 <212> TYPE: DNA 

348 <213> ORGANISM: Pholas dactylus 

351 <400> SEQUENCE: 11 

352 acacagcccc aaagcccgat 20 

356 <210> SEQ ID NO: 12 

357 <211> LENGTH: 20 

358 <212> TYPE: DNA 

359 <213> ORGANISM: Pholas dactylus 

361 <400> SEQUENCE: 12 

362 ttgcccggct agttgactac 20 

366 <210> SEQ ID NO: 13 

367 <211> LENGTH: 24 

368 <212> TYPE: DNA 

369 <213> ORGANISM: Pholas dactylus 

371 <400> SEQUENCE: 13 

372 catatttcaa ccagtgttta ttaa 24 

376 <210> SEQ ID NO: 14 

377 <211> LENGTH: 19 

378 <212> TYPE: DNA 

379 <213> ORGANISM: Pholas dactylus 

381 <400> SEQUENCE: 14 

382 caattgtgcc ttcgatgca 19 

386 <210> SEQ ID NO: 15 

387 <211> LENGTH: 17 

388 <212> TYPE: DNA 

389 <213> ORGANISM: Pholas dactylus 

391 <400> SEQUENCE: 15 

392 ggactgtggg ctcttag 1*7 

396 <210> SEQ ID NO: 16 

397 <211> LENGTH: 20 
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Please Note: 

Use of n and/or Xaa have been detected in the Sequence Listing. Please review the 
Sequence Listing to ensure that a corresponding explanation is presented in the <220> 
to <223> fields of each sequence which presents at least one n or Xaa. 

Seq#:7; N Pos . 3 
Seq#:8; N Pos. 12,15 
Seq#:9; N Pos. 3,6 
Seq#:10; N Pos, 9 
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VERIFICATION SUMMARY 

PATENT APPLICATION: US/09/831, 14 2A 



DATE: 03/15/2002 
TIME: 11:08:10 



Input Set : A:\es.txt 

Output Set: N:\CRF3\03152002\I831142A, raw 



L:10 M:270 C: Current Application Number differs, Replaced Current Application Number 

L:ll M:271 C: Current Filing Date differs/ Replaced Current Filing Date 

L:295 M:341 W: (46) "n" or "Xaa" used, for SEQ ID# : 7 after pos . : 0 

L:311 M:341 W: (46) "n" or "Xaa" used, for SEQ ID# : 8 after pos . : 0 

L:326 M:341 W: (46) "n" or "Xaa" used, for SEQ ID# : 9 after pos . : 0 

L:341 M:341 W: (46) "n" or "Xaa" used, for SEQ ID# : 10 after pos . : 0 
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